Incomplete Multi-Modal Visual Data Grouping
نویسندگان
چکیده
Nowadays multi-modal visual data are much easier to access as the technology develops. Nevertheless, there is an underlying problem hidden behind the emerging multi-modality techniques: What if one/more modal data fail? Motivated by this question, we propose an unsupervised method which well handles the incomplete multi-modal data by transforming the original and incomplete data to a new and complete representation in a latent space. Different from the existing efforts that simply project data from each modality into a common subspace, a novel graph Laplacian term with a good probabilistic interpretation is proposed to couple the incomplete multi-modal samples. In such a way, a compact global structure over the entire heterogeneous data is well preserved, leading to a strong grouping discriminability. As a non-trivial contribution, we provide the optimization solution to the proposed model. In experiments, we extensively test our method and competitors on one synthetic data, two RGB-D video datasets and two image datasets. The superior results validate the benefits of the proposed method, especially when multimodal data suffer from large incompleteness.
منابع مشابه
Damage detection of skeletal structures using particle swarm optimizer with passive congregation (PSOPC) algorithm via incomplete modal data
This paper uses a PSOPC model based non-destructive damage identification procedure using frequency and modal data. The objective function formulation for the minimization problem is based on the frequency changes. The method is demonstrated by using a cantilever beam, four-bay plane truss and two-bay two-story plane frame with different scenarios. In this study, the modal data are provided nume...
متن کاملMulti-modal Primitives as Functional Models of Hyper-columns and Their Use for Contextual Integration
In this paper, we describe a biological motivated image representation in terms of local multi–modal primitives. These primitives are functional abstractions of hypercolumns in V1 [13]. The efficient and generic coding of visual information in terms of local symbolic descriptiones allows for a wide range of applications. For example, they have been used to investigate the multi–modal character ...
متن کاملGrouping and Segregation of Sensory Events by Actions in Temporal Audio-Visual Recalibration
Perception in multi-sensory environments involves both grouping and segregation of events across sensory modalities. Temporal coincidence between events is considered a strong cue to resolve multisensory perception. However, differences in physical transmission and neural processing times amongst modalities complicate this picture. This is illustrated by cross-modal recalibration, whereby adapt...
متن کاملFEM Updating for Offshore Jacket Structures Using Measured Incomplete Modal Data
Marine industry requires continued development of new technologies in order to produce oil. An essential requirement in design is to be able to compare experimental data from prototype structures with predicted information from a corresponding analytical finite element model. In this study, structural model updating may be defined as the fit of an existing analytical model in the light of measu...
متن کاملDisambiguating Multi–Modal Scene Representations Using Perceptual Grouping Constraints
In its early stages, the visual system suffers from a lot of ambiguity and noise that severely limits the performance of early vision algorithms. This article presents feedback mechanisms between early visual processes, such as perceptual grouping, stereopsis and depth reconstruction, that allow the system to reduce this ambiguity and improve early representation of visual information. In the f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016